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IN THE SPECIFICATION: 

Please replace the following paragraph: 
Paragraph beginning at page 1, line 3: 

This application is a continuation of U.S. Application No.09/370,295, filed 
August 9,1999 and now abandoned, which claims the benefit of U.S. Provisional 
Application No. 60/096,342, filed Auc(ust 12. 1998. 

REMARKS 

In the Office Action Claims 13-15, 19-22 and 24 are rejected under U.S.C.112, 
first paragraph as the specification has not been found to enable any person skilled in 
the art to which it pertains, or with which it is most nearly connected, to make and use 
the invention commensurate in scope with these claims. Rejection of Claims 13-15, 
19-22 and 24 is respectfully traversed on ground of the discussion below, which 
should also remove the examiner's objection to claims 16 and 17 on ground of their 
dependence on claim 13. A copy of Nosaka et al. (J. Biol. Chem. 268:17440-17447, 
1993), enclosed for the Examiner's convenience (copy previously submitted with the 
IDS dated February 25, 2002), describes the isolation and characterization of a 
thiamin pyrophosphokinase gene from yeast (gi: 6324717). The expression of the 
yeast enzyme in E. co//, which lacks thiamin pyrophosphokinase activity, showed 
marked activity of this enzyme in the prokaryotic cell. AppendijLA, enclosed herewith, 
shows a comparison of the claimed sequence with the yeast thiamin 
pyrophosphokinase and the mouse gene product (gi: 6468206), which share 18.7% 
and 24.3% sequence identity with the claimed sequence, respectively. All three 
sequences display essentially the same glycine-rich motif typical for nucleotide 
phosphate group binding and reminiscent of the Rossmann fold (Rossmann et a!., 
Nature, Vol: 250, 194-199, 1974, copy enclosed for Examiner's convenience). Apart 
from thiamin, thiamin pyrohosphokinase also uses ATP as a substrate. Thus, the 
existence of such a binding site would be expected. Recently the crystal structure of 
the mouse enzyme has been revealed, which provides further support for above 
discussion (Timm et al., JMB, Vol:310, 195-201, 2001, copy enclosed for Examiner's 
convenience). The crystallographic data of the mouse enzyme identified residues of 
the thiamin pyrophosphokinase active site. All of the residues identified are essentially 
conserved among all three sequences. For the Examiner's convenience those 
residues have been underlined In the alignment in Appendix A. 

This comparison demonstrates the sequence of the invention possesses 
stretches of highly conserved regions. One skilled in the art would appreciate that the 
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more highly conserved a residue is, the less likely that it could be modified and 
function maintained. From this alignment in the attached Appendix, one could quickly 
determine which amino acid residues might be modified in SEQ ID NO:2 without a 
likely change in function. Since SEQ ID NO:2 share 18.7% and 24.3% identity with 
the yeast and mouse protein, respectively, one of skill in the art would have 
appreciated that many variants sharing at least 80% sequence identity to the SEQ ID 
NO:2 would have been expected to retain Thiamin pyrophosphokinase activity. 

In view of the discussion above, one skilled in the art would have known, at 

the time the claimed invention was made, how to use the claimed sequence without 

undue experimentation and that the rejection according to 35 USC §1 12, first 

paragraph, should be removed. 

For the foregoing reasons. Applicants respectfully request reconsideration and 

allowance of the claims. 

Please charge any requisite fee to Deposit Account No. 04-1928 (E. I. du Pont 

de Nemours and Company). 



Respectfully submitted, 




REGISTRATION NO. 34,293 
TELEPHONE: (302) 992-4926 
FACSIMILE: (302) 892-1026 





APPENDIX A 



Comparison of the amino acid sequences of the Thiamin Pyrophosphokinases from 
corn clone contig with SID No's; (SEQ ID N0:2), yeast and mouse set forth NCBI General 
Identifier No. 6324717 and 6468206, respectively. Amino acids conserved among all 
three sequences are indicated with an asterix above the conserved residues, 
respectively. Dashes are used by the program to maximize alignment of the sequences. 
The likely nucleotide binding site (Rossmann et al., Nature, Vol: 250, 194-199, 1974) is 
boxed and residues involved in Thiamin binding (Timm et al., JMB, Vol:310, 195-201, 
2001) are underlined. 



SEQ_id_2 

gi-6324 717_yeast 
gi-6468206_mouse 



MLWRAVRSMDVIMHSSSFL LPKLyQPVNKPVKNYALWLNQHLP- - -RFMPRL 

MSEECIENPERIKIGTDLINIRNKMNLKELIHPNED--ENSTLLILNQKIDIPRPLFYKI 
ME-HAFTPLEPLLPTGNL KYCLWLNQPLD ARFRHL 



SEQ_id_2 

gi-6324717_yeast 
gi-64 68206_mouse 



WDHANLRICADGGANHIFDEMYQITNDEDKKSTRNKYVPEIIEGDMDSIRPEVKLFYSSQ 

WKLHDLKVCADGAANRL YD YL DDDETLR I KYL PNY 1 1 GDLD S L S E KVYKY YRKN 

WKKALLRACADGGANH LYDLTEGE RESFLPEFVSGDFDS IRPEVKEYYTKK 



SEQ_id_2 

gi-6324717_yeast 
gi- 64682 06_mouse 



GSKISDKSHNQETTDLHKCISRIHHCTPDDEKPNLC 

KVTII-KQTTQYSTDFTKCVNLISLHFNSPEFRSLISNKDNLQSNHGIELEKGIHTLYNT 
GCDLISTP-DQDHTDFTKCLQVLQRKIEEKELQVDV 



***** 



SEQ_id_2 

gi-6324717_yeast 
gi- 64682 06_mouse 



VLVT GALGGR FDHEAANINVLY- L FSDMRI VLLSDDCLIRLLP 

MTESLVFSKVTPISLLALGGIGGRFDQTVHSITQLYTLSENASYFKLCYMTPTDLIFLIK 
1 VTL GGLGGR FDQIMAS VNTLF - QATHITPVPI 1 1 IQKDSLI YLLQ 



* ** ** 



SEQ_id_2 

gi-6324717_yeast 
gi-6468206_mouse 



RTHR HELYIESSVEGPHCGLFPVGAPSTSTTTTGLKWNLSE-SKMRFGSMISTSNI 

KNGTLIEYDPQFRNTCIGN--CGLLPIGEATLVKETRGLKWDVKNWPTSWTGRVSSSNR 
PG-K HRLHVDTGMEGSWCGLIPVGQPCNQVTTTGLKWNLTN-DVLGFGTLVSTSNT 



SEQ_id__2 VQSEK-VTVESDADLLWTISL RNLT 

gi - 632471 7_yeas t FVGDNCCFIDTKDDI ILNVEI FVDKLIDFL 
gi-646820 6_tnous e YDGSGLVTVETDHPLLWTMAI KS-- 



doi:10.1006/imbi.2001.4727 available online at httpy/www.idealibrary.com on IBlf^i^J. Mol. Biol. (2001) 310, 195-204 
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Crystal Structure of Thiamin Pyrophosphokinase 

David E. Timm*, Jingyuan Liu, L.-J. Balcer and Robert A. Harris 

Department of Biochemistry Thiamin pyrophosphate (TPP) is a coenzyme derived from vitamin Bi 
and Molecular Biology, Indiana (thiamin). TPP synthesis in eukaryotes requires thiamin pjrrophosphoki- 
Uniyersity School of Medicine nase (TPK), which catalyzes the transfer of a pyrophosphate group from 
Indianapolis, IN 46202, USA ATP to thiamin. TPP is essential for. central metabolic processes, includ- 
ing the formation of acetyl CoA from glucose and the Krebs cycle. 
Deficiencies in human tliiamin metabolism result in beriberi and 
Wernicke encephalopathy. The crystal structure of mouse TPK was deter- 
mined by multiwavelength . anomalous diffraction at 2.4 A resolution, 
and the structure .of TPK complexed with thiamin has been refined at 
1.9 A resolution. The TPK polypeptide folds as an a/p-domain and a 
P-sandwich domain, which share a central ten-stranded mixed p-sheet. 
TPK subunits associate as a dimer, and thiamin is bound in the dimer 
interface. Despite lacking apparent sequence homology with other pro- 
teins, the a/p-domain resembles the Rossman fold and is similar to other 
kinase structures, including another pyrophosphokinase and a thiamin 
biosynthetic enzyme. Comparison of mouse and yeast TPK structures 
reveals differences that could be exploited in developing species-specific 
inhibitors of potential use as antimicrobial agents. 
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Introduction 

Humans lack the ability to synthesize thiamin 
and, therefore, require dietary sources of this com- 
pound, also known as vitamin B^. Thiamin is con- 
verted to thiamin pyrophosphate (TIfP) by thiamin 
pyrophosphokinase (TPK).' TPP is essentia for nor- 
mal carbohydrate utilization as a coenzyme for 
central metabolic functions,^ including the Krebs 
cycle as part of the a-ketoglutarate dehydrogenase 
complex and the conversion of pyruvate to acetyl 
CoA catalyzed by the pyruvate dehydrogenase 
complex. The importance of thiamin in human and 
animal diets has been appreciated for a long time. 
Thiamin deficiency in humans causes beriberi and 
Wernicke encephalopathy. Beriberi has been 
known for centuries as a potentially fatal disease of 
the Far East resulting in neurologic and cardio- 
vascular problems. Wernicke encephalopathy is a 
neurological condition often encountered in 



Abbreviations used: HPPK, hydxoxymethyldihydro- 
pterin pyrophosphokinase; MAD, multiwavelength 
anomalous diffraction; PEG, polyethylene glycol; 
PRPP, phosphoribosylpyrophosphate synthetase; 
TPK, thiamin pyrophosphokinase; TPP, thiamin 
pyrophosphate. 

E-mail address of the corresponding author: 
dtimm@iupui.edu 



chronic alcoholism due to reduced intake and 
uptake of dietary thiamin.^ Inhibition of TPK by 
ethanol may also contribute to this condition.^'^ 

The formation of TPP is ^necessary for centrally 
important metabolic processes,^ and the loss of 
TPK activity is lethal in yeast.^'^ In addition to 
functioning as a carrier of activated acyl groups in 
a-keto add dehydrogenases required for glucose 
metabolism, TPP is required for carbohydrate 
metabolism by transketolase, pyruvate decarboxy- 
lase and pyruvate oxidase. TPP is also a coenzyme 
for the branched-chain a-keto add dehydrogenase 
complexes and acetolactate synthase required for 
branched chain amino acid degradation and 
synthesis,^ respectively. TPP formation is also 
important in signaling and driving cellular trans- 
port of tiiiamin.^ TPP levels regulate mitochon- 
drial 

a-keto add dehydrogenases,^ and feedback inhi- 
bition of TPK by TPP^°-^^ may be a mechanism for 
maintaining proper TPP levels as a regulatory fea- 
ture of these central metabolic enz)anes. TPP also 
regulates transcriptional levels of thiamin biosyn- 
thetic enzymes in yeast and effects mating in fis- 
sion yeast.^'® Finally, TPP is the precursor of 
thiamin triphosphate, which may have non-cofac- 
tor effects on neurologic cells.^^ 
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TPK Crystal Structure 



TPK has been characterized using material iso- 
lated from naturaP°"^^'^^ and recombinant 
sources/'®'^^ TPK amino acid sequences are poorly- 
conserved between species, and database searches 
have revealed no sequence similarities to other 
kinases or other families of proteins. TPK from 
different species has subunit masses ranging from 
23-36 kDa associated as dimers in solution, 
TPK utilizes ATP and thiamin as substrates in a 
pjnrophosphate transfer reaction that requires Mg^*** 
and jproceeds by an ordered sequential mechan- 
ism.^^-^2 Crystallographic studies of TPK were 
initiated to provide an improved understanding of 



the enzyme and aspects of its structure and func- 
tion at die molecular level 



Results and Discussion 

Structure determination 

Recombinant mouse TPK was crystallized by 
vapor diffusion (see Materials and Methods), and 
the crystal structure was determined at 2.4 A 
(Figure 1(a)) from multiwavelength anomalous 
diffraction^^ data measured using selenomethio- 
nine substituted crystals (Table 1). Data collected 





Figure 1. Electron density maps, (a) Experimental electron density calculated at 2.4 A and contoured at 1.75 a is 
shown in stereo for residues to Gly225 to Ile241. (b) A 1.9 A omit map was calculated following a 5000 K simulated 
aimealing refinement using a model lacking thiarnin. The thiazole and pyrimidine rings are numbered according to 
use in Results and Discussion. The 2Fo - Fc (dark blue) and fo - (light blue) maps are contoured at 1.8 and 4.0 a, 
respectively. The Figure was created using O.^ 
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Table 1. Mouse IPK data collection, structure solution and refinement statistics 



MAD MAD MAD 

Inflection Peak Remote Iluamin 



Energy (keV) 


12.644 


12.648 


13.400 




Observations (total/unique) 


273,244/49,984 ■ 


282476/49,817 


284,364/50,110 


330,681/52,413 


Completeness (%) 


100.0(100.0) 


100-0(100.0) 


100.0(100.0) 


99.9(98.9) 


1/al 

Resoution 


0.069(0.465) 


0.063(0.337) 


0.069(0.508) 


0.073(0.406) 


22.9(3.7) 
30.0-2.4 


25.5(5.3) 
30.0-2.4 


24.1(3.5) 
30.0-2.4 


18.6(2.2) 
30.0-1.9 


Residues 




496 




496 


Water molecules 




255 




413 


Sulfate groups 




0 




14 


Thiamin molecules 




0 




2 






0.225(0.250) 




0.214(0.239) 


rmsd bonds (A) 




0.007 




0.008 


imsd angles (deg.) 




1.34 




1.43 


Ramachandran most favored (%) 




89.1 




90.8 


Ramachandran allowed (%) 




9.7 




8.7 



Energy in keV ^ 12.398/>. in A. Values for MAD data are from SCALER ACK with Bijvoet pairs kept separate in scaling. 
Rnwige = S|/ - <i)|/SI, where / is the integrated intensity of a given reflection. Nvimbers in parentheses represent the values obtained 
in the highest resolution bin (2.49-2.40 A for MAD; 1.97-1.90 for thiamin). The refinemeit residual, R = 'L\F^ - F^J/EF^* with 
the cross-validating calculated from 1314 and 2606 test reflections excluded from reflnements of the MAD and thiamin models, 
respectively. 



from crystals soaked with thiamin were used for 
subsequent refinement of the enzyme /substrate 
complex at 1.9 A (Figure 1(b)). The asymmetric 
unit of the TPK crystals contains two subunits. 
Electron density for the main chain is continuous 
for authentic residues in both subunits. However, 
disorder is indicated for the N-terminal His tags 
with nine residues missing from the electron den- 
sity for one subtmit and 19 missing from the other 
subunit. 



The TPK structure 

The TPK subimit structure can be described as 
an N-tenninal a/ P-domain and a C-terminal 
P-sandwich domain assembled about a central ten- 
stranded p-sheet (Figure 2). The first half of the 
central p-sheet is arranged as parallel P-strands 
sandwiched between a-helices to form a three- 
layered a/p/a sandwich. The second half is 
formed from a jelly-roll of antiparallel strands 
which gives rise to the second sheet of the two- 
layered P-sandwich. The overall dimensions of the 
subunits are about 56 A in length and 31 A in 
width. 

The TPK subunits associate as a dimer shaped 
Hke a diamond (Figure 2(c)) or a U (Figure 2(d)) 
when viewed parallel with or perpendicular to the 
non-crystallographic 2-fold axis, respectively. Resi- 
dues have been nimibered relative to the authentic 
start Met residue and end in A or B to distinguish 
between the two subimits. The dimeric association 
is mediated primarily by interactions between the 
P-sandwich domain of one subimit with both the 
a/p and P-sandwich domains of the opposite sub- 
unit. These contacts primarily involve the* loops 
between residues ^0207 and^ Pro211 and between 
,Giy23fl ^and Th rg^^THaT proti^^ towards the 
dimer interface from the opposing P-sandwich 



domains. Limited direct contact is also made 
between the a/P-domains in the vicinity of resi- 
dues Leul28 to Leul39 and their 2-fold related 
counterparts. Consistent with previous studies 
indicating a dimeric association in solution/°'^* the 
solvent-accessible surface area buried between the 
subimits is 2808 A^. Surprisingly, 11 residues of 
the His tag from one subimit extend away from 
the TPK dimer to make a seemingly urilikely lattice 
contact that forms a short two-stranded P-sheet 
centered about a 2-fold crystallographic axis. 



Thiamin-binding site 

The location of the TPK active site has been 
identified using data collected from crystals soaked 
with thiamin (Table 1; Figure 1(b)). The thiamin- 
binding site is located in tihe dimer interface distal 
to the non-crystallographic 2-fold axis in a cleft 
between the a/ P-domain of one subunit and the 
P-sandwich of the opposite subimit (Figure 2(c) 
and (d)). This birjding site is/formed by a P-bulge 
between ^r216B and AsnZlSB/ and loops between 
Gl5d99E.^ AsnSOaafand between As p95A and 
AspJi lO iFigure ^). The pyrimidine ring a^d 
methylene C atom stack parallel with the Trp202B 
indole ring (Figure 1(b)). The pjmmidine methyT 
group is in van der Waals contact with the side- 
chains of Leu204B and LeuJMB in the hydrophobic 
core of the p-sandwich. Hydrogen bond potential 
exists for the side-chains of Ser216B and Asn219B 
and the pyrimidine ring Nl' atom, 'while the 
Asp97A side-chain appears to accept a hydrogen 
bond from the amino group and participate in a 
water-mediated hydrogen bond to the N2' atom 
(Figure 3). The positive charge on the thiazole ring 
N3 atom may interact with partial negative charges 
on the main-chain O atoms of Thr21B and 
Gln96A located to either side. The thiazole methyl 




Figure 2(a)-(c) {legend opposite) 
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(d) 



Figure 2. The TPK structure, (a) A C" trace is shown in stereo for the motise TPK subunit. (b) A ribbon diagram 
illustrating the distribution of secondary structures within TPK is shown in stereo, (c) A ribbon diagram of the TPK 
dimer is viewed along the centrally located 2-fold non-crystallographic axis, (d) A ribbon diagram of the TPK dimer 
viewed perpendicular to the 2-fold non-crystallographic axis. The position of thiamin is represented in (c) and (d) as 
a ball-and-stick model with Q N, O and S atoms colored yellow, blue, red and green, respectively. This Figure and 
Figures 3 and 4 were created using MOLSCRBPT.^ 



group and C5 atom are also in van der Waals con- 
tact with the CH2 atom of Trp202B and the 
atom of Ser218B, respectively. The thiamin hydro- 
xyethyl group points in the direction of the non- 
crystallographic 2-fold axis into a solvent-accessible 
space that occurs in the a/p-domain between the 
parallel ^strands and the a-helices beginning at 
Gly47A, Thr99A; and Argl31A. 

Crystal structures of several TPP-dependent 
enzymes have been reported. ^''"^^ These include 
transketolase, pyruvate decarboxylase, pyruvate 
oxidase, benzoylformate decarboxylase and 
branched-chain a-keto acid dehydrogenase. Crystal 
structures have also been reported for thiamin 
phosphate synthase^^ and hydroxyethylthiazole 
kiniase,^ which are involved in microbial thiamin 
biosynthesis. Some general features are shared in 
common between TPK and certain TPP-dependent 
enzymes. The p5n:imidine ring tends to bind in a 
hydrophobic environment for most of these 
enzymes. Also, the intersubimit binding site and 
aromatic stacking of the pyrimidine ring in TPK 
are similar to interactions observed in transketolase 
and the a-ketoacid dehydrogenase. However, these 
features are not universal among all TPP-depen- 
dent enzymes, and many differences can be ident- 
ified. The TPP-dependent enzymes impose an 
energetically tmfavorable "V" conformation 
between the pyrimidine and thiazole rings that 
positions the amino group to activate 5ie C2 
carbanion.^ Thiamin phosphate synthase also 
imposes a V conformation, but places the thiazole 
methyl group in a ds position relative to the amino 
group. 

In contrast, TPK has no apparent functional gain 
from unfavorable steric arrangements of the two 
ring systems and represents the first example of 
thiamin binding to an enzyme in the low-energy 



"F" conformation. Thus, the highly conserved 
intercalation of a hydrophobic side-chain between 
the pyrimidine and thiazole rings in TPP-depen- 
dent enzymes and thiamin phosphate synthase is 
absent from TPK. Interactions of polar groups with 
both the pyrimidine aiid thiazole groups observed 
in the TPK/ thiamin structure ako vary signifi- 
cantly compared to other thiamin-binding 
enzymes. In particxilar, the conserved hydrogen 
bonding of a Glu side-chain with the pyrimidine 
Nl' atom believed to stabilize an intiino tautomer 
of N4' in TPP-dependent enzymes^ is absent from 
the TPK structure. 



Comparison to yeast TPK 

Mouse TPK has been compared to the structure 
of TPK from Saccharomyces cerevisiae to evaluate 
the conservation of structural features between 
enzymes sharing a 26 % sequence identity over 288 
residues. Details of the yeast TPK structure, con- 
taining thiamin and refined at 1.8 A to an K-f actor 
of 0.216 (Rfree = 0.234), will be reported elsewhere.. 
The yeast TPK amino acid sequence is 75 residues 
longer than mouse TPK. This difference in size is 
due to 20 additional N-terminal and seven 
additional C-terminal residues, a 38-residue inser- 
tion and several smaller insertions of three residues 
or less. Overall, the TPK dimers from these species 
superimpose with a root-mean-square deviation 
(rmsd) of 1.6 A for 406 C" atoms in 19 secondary 
structures (Figure 4(a)). The N-terminal extension 
in yeast TPK wraps around the p-sandwich 
domain to add an additional antiparaUel P-strand 
in close proximity to the thiamin binding site. The 
38 residue ii\sertion occurs at the end of an a^helix 
to replace a short loop between LjrsllS and Vall21 
in mouse TPK with a structure containing two a- 
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Figure 3. The TPK active site. A stereodiagram of the TPK active site is shown with potential electrostatic inter- 
actions described in Results and Discussion indicated by broken lines. The Trp202 side-chain, located in the right 
foreground of the Figure, has been omitted to maintain an imobstructed view. Atom colorii^ has been described in 
the legend to Figure 2. 



helices. This insertion packs against the a/p- 
donnain near the helix between 11^99 and LysllS 
and the edge of the parallel P-sheet in close proxi- 
mity to the loop contacting the thiazole ring 
between Asp95 and AsplOO. Many of the residues 
in the active site and contacts between thiamin and 
TPK are conserved, including the pyrimidine ring 
stacking against a Trp side-chain, the hydrogen 
bonding of a Ser side-chain to the Nl' atom, and 
the positioning of the thiazole N3 atom between 
main-chain oxygen atoms (Figure 3). However, the 
environment surrounding the pyrimidine methyl 
group in mouse TPK is formed by a Cys, a Trp 
and a.Val residue in yeast TPK, and the Asp inter- 
acting with the N3' and N4' atoms in mouse TPK 
is replaced by Tyr. 

Other similar structures 

While structures similar to both TPK domains 
are common in the Protein Data Bank (PDB), it is 
worth noting that these motifs are also present in 
other proteins of related function. * Structures 
resembling the TPK a/p-domain are present in 
another pyrophosphokinase and several small mol- 
ecule kiiiases. Repeats of a five-stranded parallel p- 



sheet flanked by helices on either side are found in 
the structure of phosphoribosylpyrophosphate 
(PRPP) synthetase.^* This enzyme also catalyzes a 
P5n:ophosphate transfer reaction and superimposes 
on moxise TPK with an rmsd of 2.0 A over 58 C* 
atoms present in five secondary structures 
(Figure 4(b)). Comparable levels of structural simi- 
larity are also foimd for other small molecule 
kinases, including adenosine kinase, ribokinase 
and hydroxyethylthiazole kinase (ThiK).^ Interest- 
ingly, ThiK is required for microbial biosynthesis 
of thiamin via a salvage pathway, and the active 
sites of TPK and ThiK occur in the same general 
vidnity following superposition. However, the 
substrate binding sites do not superimpose 
between the two structures. The hydroxyethyl thia- 
zole ring in ThiK is located about 9 A away from 
the thiamin thiazole ring in TPK, and the superim- 
posed ATP bound to ThiK clashes with the TPK 
loop between Asp95 and AsplOO. Besides PRPP, 
the only other known pyrophosphokinase structure 
of hydroxymethyldihydropterin pjnrophosphoki- 
nase (HPPK)^ shows only superficial similarity to 
TPK, with a three-layered a/p/a structure contain- 
ing a central antiparallei rather than a parallel P- 
sheet. Finally, elements of the TPK a/P-domain 
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Figure 4. TPK superpositions, (a) The mouse TPK dimer (blue) is superimposed with the yeast homolog (red). The 
view is onto the ''back side" of the active site with the structures rotated approximately 180° in the plane of the 
paper relative to the view shown in Figure 2(c). Thiamin molecules are colored as described in the legend to Figure 2. 
(b) The TPK a/p-domain (blue) is stqjerimposed on a similar structure present in phosphoribosylpyrophosphate 
synthetase (violet). 



also resemble the ubiquitous Rossmann fold pre- 
sent in many nucleotide-binding proteins.^^ 

Numerous structural homologs of the TPK 
P-sandwich domain are also represented in the 
PDB. The TPK jelly-roll motif is most similar to 
a nimiber of viral structural proteins and conca- 
navalins. Similarities between TPK and an epi- 
merase, RmlC/^ from Gram-negative bacteria 
may be of greater functional sigruficance. The 
superposition of RmlC and TPK gives an rmsd 
of 1.7 A for 39 atoms in six secondary 
structures. The pyrimidine ring of the nucleotide 
substrate of RmlC binds to the exposed 
edge of a p-sandwich, stacking against a Tyr 



side-chain in a manner similar to the pyrimidine 
binding to TPK. However, the comparison is 
again general in nature, and the RmlC binding 
site actually occurs on the opposite side of the 
P-sandwich relative to the superimposed TPK 
structure. 

C nclusfons 

The TPK structure provides a molecular frame- 
work for understanding the general mechaiusm 
and regulation of an essential enzyme and pyro- 
phosphate transfer reactions. This type of bio- 
chemical reaction is essential for many aspects of 
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living systems. In addition to thiamin metabolism, 
pyrophosphate transfer reactions are also essential 
for the biosjmthesis of amino acids, folates, purine 
and pyrimidine bases, and the nucleotides and 
coenz3nnes derived from these compounds. Recent 
reports of pyrophosphokinase structures provide 
an opportunity to further the understanding of this 
group of enzymes, which lags significantly behind 
traditional kinases that trai^fer orthophosphate. 

While additional studies of TPK are required to 
determine details of ATP and Mg^"*" binding, regu- 
lation by TPP and inhibition by ethanol, the TPK/ 
thiamin structure provides some insight into 
aspects of TPK function. A likel y ATP-hinding sitP 
is formed in the vicinity oTthiamin and the Argl31 
siae-chain . The high degree of solvent accessibility 
of this site is consistent with the relatively high 
value rgjorted for ATP in kinetic studies of 
TPK}^-^^ An interaction of the p and/or y phos- 
phate groups with the guanidinium group would 
place the P-phosphorus atom in a position suitable 
for in-line nucleophilic attack by the thiamin 
hydroxyl oxweiLatrai. The structure and sequence 
surrounding(^^ rgl3l) is consistent with this scen- 
ario. A stilfate ion is bound to the Argl31 side- 
chain located within 2.5 A of the thiamin hydroxyl 
(Figure 3). A nearby duster of Asp residues (46, 71, 
73 and 100) is likely to function along' with phos- 
phate groups and potentially the thiamin hydroxyl 
group in tiie coordination of the Mg^"*" cofactor 
required for catalytic activity. Asp side-chains also 
coordinate Mg^**" in many TPP-dependent enzymeis. 
Argl31 is at ^le N teiminus of ah a-helix preceded 
by a loop . containing a Gly-rich sequence 
(126-GGLGGR-131^ reminiscent of the Rossmann 
fold^^GXGXXG m otif that functions with the helix 
dipole to bind nucleotide phosphate groups. A 
Gly-rich motif (14-GA GKGT-19 ) also interacts with 
nucleotide boimcJto adenosine kinase.^ However, 
barring a large conformational change, the 13.5 A 
distance between thiamin and the nearest Gly resi- 
due in this loop makes it unlikely for the Gly resi- 
dues to directiy contact phosphate groups in TPK. 
This situation is similar to that of aldehyde 
dehydrogenase,^^ which revealed novel placement 
and use of the Rossmann fold in nucleotide bind- 
ing. It should also be noted that the Gly-rich motif, 
Argl31 and the Asp cluster are strictly conserved 
between TPKs from different species. However, the 
lack of sequence similarities and conservation of 
specific ligand interactions between TPK and other 
thiamin and nucleotide binding proteins suggests 
that rather than having an evolutionary rektion- 
ship, common structural motifs have simply been 
adapted in different ways to a common use. 

Finally, the results of these crystallographic stu- 
dies suggest that TPK should be considered as a 
target for the design of antimicrobial agents. TPK 
is essential for basic carbohydrate utilization; 
S. cerevisiae and Schizosaccharomyces pombe growth 
is dependent on TPK,^**^ and other 
pyrophosphokinases^*'^ are being targeted for 
similar purposes. Mouse TPK residues Asp97, 



Asp207 and Leu214 are substituted in yeast by 
Tyr, Trp and Arg, respectively. These non-conser- 
vative substitutions could be exploited in develop- 
ing spedes-specific thiamin analogs. Furthermore, 
the additional 
N-terminal P-strand present in yeast TPK places a 
sulfhydryl group in van der Waals contact with 
thiamin: While additional full-length TPK 
sequences will need to be determined to fully 
assess TPK as a target, the mouse enzyme shares 
91 % identity over 137 residues of a partial hirnian 
sequence, but only 38 % identity with the next clo- 
sest sequence available. Therefore, it seems likely 
that the level of variability between mouse and 
.yeast TPK will be similar to the levels between 
humans and pathogens. 

Materials and Methods 

Expression and crystallization 

Mouse TPK^^ has been cloned and expressed in Escher- 
ichia coli as a polyhistidine fusion protein using the 
vector pET28a. The fusion protein contains a 20 residue 
His tag for a total of 263 residues. TPK was purified 
using nickel-charged metal chelate chromatography 
(Pharmacia) and elution with imidazole, and is stored 
frozen at -80 °C in 150 mM NaQ. Crystals were grown 
at room temperature in the space group P3i21 having 
unit cell dimensions of 90.1 A x 90.1 A x 140.4 A using 
the hanging drop vapor diffusion method where equal 
volumes of an 11 mg/ml TPK solution were mixed with 
precipitant solutions containing 1.8-2.1 M ammonium 
sulfate, 0.1 M Hepes and 2-4% PEG400, pH 7.1. 

Structure determination and crystai soalcing 

The TPK crystal structure has been determined using 
the multiwavelength anomalous diffraction (MAD) 
method. Selenomethionine-substituted TPK was pro- 
duced by over-expressing the recombinant enzyme in a 
defined medium containing selenomethionine and using 
the methiorune auxotrophic strain, DL41(DE3). MAD 
data (Table 1) were collected at the Advanced Photon 
Source synchrotron beam line 19ID. The data were inte-. 
grated using HKL2000 and merged and scaled using 
SCALEPACK.^ The partial structure of seven selenium 
atoms was determined using the program SOLVE.^^ The 
Se parameters were further refined and crystallographic 
phases were calculated to 2.4 A using MLPHARE.^^ 
Further improvements of the experimental electron 
density map were achieved by density modification and 
2-fold non-crystallographic symmetry averaging 
implemented in DM.^^ Data from TPK crystals soaked in 
precipitant solution containing 10 mM thiamin for 
16 hours were collected using a Rigaku RU200 rotating 
anode X-ray generator and an R-axis He detector. These 
data were integrated using HKL vl.96.0 and merged and 
scaled using SCALEPACK.^ 

liA del buliding and refinement 

The initial SeMet-substituted TPK model containing 
494 amino add residues was built using the program 
O.^ This model was refined with cycles of water build- 
ing using REFMAC^^ and ARPP^^ with non<rystallo- 
graphic symmetry restraints and a bulk solvent 
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correction to yield a model having 66 water molecules 
and an ]?-factor of 0.26 {Rfg„ = 0.29). Manual rebuilding 
of the model, placement of additional water molecules 
and refinement using CNS^ vO.9 with non-aystallo- 
graphic s)anmetry restraints and a bulk solvent correc- 
tion yielded an 2^-jfactor of 0.225 {Rf^ = 0.250) at 2.4 A 
(Table 1). This model was used in three additional cycles 
of model building and refinement against 1.9 A data col- 
lected from the thiamin-soaked crystal. Structural super- 
positions were performed using TOP.^^ 

Coordinates 

Coordinates and structure factors have been deposited 
in the RCSB Protein Data Bank under the accession code 
1IG3. 
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Chemical and biological 
evolution of a nucleotide-bmdmg proteui 

Michael G. Rossraann, Dmo Moras & Kenneth W. Olsen 
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Three-dimensional alignment of the common nucleotide 
binding structure in dehydrogenases, kinases and flavodoxms 
permits the recognition of homologous amino acids when 
sequence comparisons alone would fail. Minimum base 
changes per codon can then be used to measure evolutionary 
distances which suggest that this structure was present 
during precellular evolution. 

A COMMON structural domain' whose function is to biiid 
nicotinamide adenine dinudeotide (NAD) has been found in 
lactate dehydrogenase (LDH)' '. in soluble ""alate dehydro- 
genase (sMDH)«. ill liver alcohol dehydrogenase (LADH) 
and in glyccfaldehyde-3-phosphate dehdrogerar: ,vi/x. DH). 
Rao and Rossmann' showed that the same stnicture was utilised 
to bind flavin mononucleotide (FMN) in fl-voUoxin-". They 
also showed that this structure consists of two smaller units the 
function of each being to bind a mononucleotide. Buehner 
et ai* proposed an evolutionary tree which traces the incor- 
poration and evolution of the mononudeotide-binding struc- 
ture in various dehydrogenases and in flavodoxin. Further 
data on the three-dimensional structures and ammo actd 
sequences of some dehydrogenases and flavodoxms are now 
available which permits the evaluation of the proposed evolu- 
tionary tree more precisely. We show here that the position 
and sequence of the nodes in this tree are consistent with the 
molecular data, and that a rough time scale for these events 
can be proposed. 

Independent support for the evolutionary tree comes from a 
study of redox potentials and the evolution of biol^ual 
electron transport"-", suggesting a common origin for NAD 
and flavin-binding proteins. Furthermore, structures sitnilar 
to the mononudeotide or dinudeotide-binding protein frag- 
ment have been found in phosphoglycerate kinase" (Bryant 
et o/." suggest an alternative solution) and tentatively in 
adenyl kinase". Both these enzymes need to bind AMP, ADP 
or ATP. Similarly a flavodoxin-like structure has been recog- 
nised tentativdy in rhodanese" whose biological function 
is not fully estoblished but is known to bind flavin FAD FMN 
and NAD. 

Structural alignments 

The nudeotide-binding protein discussed above consists of a 
paraltel sheet formed by three extended polypeptide strands. 
The first two are connected by an a-hdU. and the last two by 



either an a^hdix or another, less well defined, structure. In 
dehydrogenases the secondary structural features of the 
dinudeotide-binding fragment have been P'^; P^;, P^' 

BD, PE and PF within the p-pleated sheet, while the helices 
Sre laWled aB. aC, . . . (Fig. 1). This nomendature was r^ 
ttinally devised for the LDH structure*. Helices aB and aC 
?onn«t strands PA with PB and PB with pC respectively. 
TheyTre on the same side of the sheet. Along the po lypep idc 
cha?n the sequence PA. oB. PB. aC, pC forms the first (adenine 
Sucl« idVSing fragment, while the second (nicotinamide) 
^Scotide-bindini fragment is related by an -PP'^o'^'f--;^" 
Two-fold axis. The fold of each fragment has the same unique 
band. 




PD PE P\ 




Fig. 1 Diagrammatic representation of a dinucleotide-bmding 
protein. The amino termini of the strands in the P-pleated sheet 
ate closest to the viewer. The dinudeotide binds to the carDoxy 
termini of the strands in the sheet. 





Flo 2 Hydrogen bonding dia- 
gram of p-pleated »heet region 
in (top) dogfish apo-LDH and 
(bottom) lobster holo-GAPDR 
Amino acids with thick outlines 
have residues facing into hydro- 
phobic pockets l^jw^n Ji}? £■ 
pleated sheet and a hdicM. 
The hydrophobic character, oi 
these residues is stronsly con- 
served ffable 1). 



There is a greater cSnservation of the structure of the sheet 
reg'SnThrof'Ihe hdices. For examge in ^^^^^^ 

ISgJ'd'JSng falysis. '^^^;,,Z^nZ2-y^< 
paralld pleated sh«t «g.on can be s^n read.^ in ji? 

i^gT'^LMtr ^^^^ s^z 
s^„-i^ir»"5!i t« s i; 

sensitive method f determimng J "^^f^'JiSdc-bindini 
equivalent p-sheet regi ns of ^ C 

fragments. The atomic coordinates of "^x t 

atoms were taken t set up an appr ,h« 
superimpose the two molecules being ~""P^f TJ" 

S^. r^iJilSinJeJt: but nevertheless foil wing the sami 
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miel Comparison of the NAD-binding domain among various 
lauic • r dehydrogenases • 
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Horse LADH 
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Bouine GluDH 
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folding sequence could be equivalenced and incorporated into 
the refinement procedure. . ^^^^ 

The amino acid alignments shown n Tabic > a^e ^"S D 
on three-dimensional structure, while '^''"^""'^.0 
Zno acids has b^J-T,tl?roVlSi<^!^^^^^^^^ 

S Iti'SlL specificity ^o^^^^l^ 

ri?iiT^Ts;;a^i^^^^^^^^ 

Is SsV sin in a series of stetcographic diagrams. The C. 
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backbone of J'^.'if f'^S 

used asasiandardofcomparison inng. J. ^ 
with the <'<='^y<>~f^"rwiS phoS^h^^^^^^^ 
fragment with the MNN-binding fragment m LDH. 

Sequence homologies with GIuDH 

has been ^^/Yr in GAP^ 245 to 283 in 

comparing residues 1 to 38 GAH^^J j,,^ „ 

bovine CluDH corresponding to JA.^ and t^ 

SLT=?e a^ t.=o^"SrcharU of important 
any larger residue ^"''^^^^"f^f ^^^^^ of the glycine in 




Fig. 3 Superimposed dinucleotide- 

gWen for LDH only. Conespond- 
fng numbers for. the other com- 
pounds (shown " We 

S»n be. derived fro^.^r^oli 
Comparisons are with (a) OAPOH 
and (b) LAIJH. 




Fie. 4 Comparison of (a) phos- 
ohoKlyccralc kinase (PGK) and (h) 
Codoxin with LDH Con^^T 
lions used arc described in rig. -J. 
TSS btalkball is the Mg'* site in 
PGK. 





could only identify one truly ^^^l^'f" ZT^tolJ^^l 
peptide chain which compares with the known inaw u 
domains in other dehydrogenases. gAPDH 
The alignment shown m T^Wi^l J'' ^'^^^^ com- 
is different ^o-'.^^^' ^^^^^^^^ 

parisons given by Engel . one minimum base 

sequence. The alignment given here ^^^^VfSoH ITOTPOsed 
thances per codon with respect to lobster GAPDH as oppcweo 
W l IsTnd 1.53 for EngcPs relationships ^^^^"^1 
Engel aligns two GluDH secuences -"'^J,-'*"",^^^^^^^^ 

joSrsrcS^^^^^ 

E^ers alignments were based on doubtful statistical pro 
cedures. 



Measuring evoluti nary divergence 

A relative estimate of the time since the diverge!^ from^ 
common ancestor for node in an evolutionanr tr^) ^ 
obtained by comparing amino acids of '^^^''^^'^^^'.^^ 
Hash and Fitch'' " used the ammo acid ««l«n^ *° 
homologous alignments between proteins then the m«n 
change between the sequence can be measured by various 
3g procedures to determine evolutional distance. In th.. 
3(«cept for the GluDH comparisons) thrw^.mens.onal 
structure has been used to obtain alignments, because of its 



n ! 7 nr base changes or on the average \.o oasc tndiib^^ 
2;io5 If he Stuatgeneticcode is taken into account^ then the • 

i^'ge number of J^/XrT 

,o change any one amm°/^;^ f. ^coSered: this number 

Unie ste Ih. occ»rr«nc< <>' » =7™" S fi^, «i.h time 
evems. This measure IS unlikely 1 ~ """i '„,fc_„™efot 
?f no «,.reclion is made fo. back ^"""SSSSl 
possible toiation from lineaniy *' "Sr^ ,'„J. 
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Sfffering funciions of the two domains w.thm a s.ngte pojy 
peptide chain' impose different "tes of cvolutioti. Thus our 
Smatcs of divergence among dehydrogenases have been 
taken only within the nucleotide-bmding domains. 

A time scale for evolution 

The evolutionary tree suggested by Buehner et al* has been 
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expanded as shown in Fig. 6. Observed minimum base changes 
Sr arc also related to a probable t.me at each node 

The divergence of the mammalian line from Avcs can be placed 
Ik^ih i y 10" yr ago (node I. Fig. 6). from Arthropoda about 
^^ 0- vr aco Ldc 2. Fig. 6) and from Fungi about 1 .2 x I0» 
/Si fIc^)"-'' The presence of the NAD-binding 
Sn^Tn c iop asmic an? mitochondrial dehydrogenases 
fmolies that it originated more than 1.5 x 10" yr ago when 
SfSndria were possibly incorporated. . into proto- 
SikSotcs". Probably, however, this structure is a good deal 
oEs it would have been required in the earliest prokaiyotes 
older as it wouio n- . assumed that 

- ott;;^pTdc; an^ nillS^ ilcirhad gained the ability to 
S^^^en^'S tra^ steps and primitive copying processes 
Crprccen'ular -olution^^we might expect to find the 
««nAniicleotidc-b nd ng protein before 3.2 x lu yr ago. 
riraufrelsonSble to' assume that it was formed sometime 
after the age of the Earth 4.5 x lO" yr ago 

F guTc 6 shows that, the larger the age associated with the 
node Ihc greater is the minimum base change per codon 
consistent wUh a divergent evolutionary process from a single 
ommon ancestor. Tha? alternative "^^^^^ 
exist is however, evident in the structures of RNase anp 
stTohvkKroLl nuclease-. Nevertheless, it might be ant. c.pated 
thaVihe^Sk shown in Fig. I will frequently be found 

Se !he?e^i^^^^^^^^ f^^P^^^^^^ ^" and basic require, 

mem? for binding nucleotides. Examples might be amino acid 
mNA symh^^^^ ribosomal proteins and virus coat proteins^ 
The recogS of this structure by sequence homology or frorn 
X-rarsXture determinations may also give guidance as to 

UniveS Of Michigan, Ann Arbor, for information on he 

M oS^^^^^^^^^ scheme in Cio^riciiun. M P Jv^^^^^^^^ 

related amino acid sequence allocations before publication . 

SirW wJ^hank Dr Susan Taylor for information on son^ 

S shrd segments of the dogfish LDH amino ?cid sequo^^^^ 

SoSsColiUlakeandCarlBra^^^^ 

of PGK and LADH, respectively. We thank Drs A, Liijas, 
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Three abundance classes in HeLa cell 
messenger RNA 

J. O. Bishop. J. G. Mortom M. R<»1«Bh & Md»iUe Richardson 



^pprojt/marWy i5.0(X> different poly{A)<ontatmng 
RNA Tguences are present in HeLa cell cytoplasm 
The seq^nces are grouped in three distinct abundance 
classes. 

The amount of DNA per haploid genome ii» the higher 
Sfa^otes is kno>vn to be very If^^^' ^^^^^ 
nosed the question of how much of the DNA encoaes 
SSSa. Up to the present time few serious attempts tave 
been made^o answer this question. One P^^^J?^ ^een 
the difficulty experienced in atummg sufficiently high 

concentrations of RNA to ^^■^^-^y^'^^^^'ZJ^^ Z 
to completion. The discovery of poly(A) stretches m 
mRN>?-' has partially removed this difficulty since 
S>MA)<onfainin'g RNA can now be sep^^^ed from 
rRNA"' and much higher concentrations of mRNA can 
be attained. A second difficulty has been the existence of 
r^petitWe sequences in the DNA': the transcript of one 
K^titive sequence can hybridise wUh related sequences. 



producing falsely, ^j^h ^^^IIl^u^S^^^^^^^ 

?t" roLe" LaSSed non«^^^^^^ DNA sequences 
risottru-sU paUU.;^ a-caled ;^h an 

Srre°;rf r h'^d^r^xya^?^^^^^^ - 

S^wn t^Tat about 10% of "^""P^^'tL^Kn RNA'^^ 
represented in transcripts found m total bra«t» RNA 
LlSiilariy 0.9% of nonrepetitive Xenopus DtfA is reprt 
seated by transcripts in the mature oocyte . . 
An even more serious problem has been that whatever 

anS:^;:^ Sned. it hfd to be ^//f J r^ong 
w« matter how much RNA was added, or for now long 
Sie Sles '^re "realed. the possibility always remained 
Sat Sis of RNA were present at a lower concentration . 

b ssrsfq^rioSS 

Jay. U is n w possible to synthesis* a complementary DNA 
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The thiSO mutant of Saccharomyees cereviaiae 
(Nishimura, H„ Kawasaki, Y., Nosaka, K., Kaneko, Y., 
and Iwashima, A. (1991) J. BacterioL 173, 2716- 
2719) shows markedly reduced activity of thiamin 
pyrophosphokinase (TPK; EC 2.7.6.2). We have iso- 
lated a DNA fragment carrying the THISO gene from 
a yeast genomic library by its ability to complement 
constitutive synthesis of the thiamin-repressible acid 
phosphatase, encoded by the PH03 gene, of thiSO mu- 
tant cells. On the other hand, the thiSO locus was found 
to be located 3.3 centimorgans proximal to the ampS 
locus on the right arm of chromosome XV by genetic 
mapping analysis, and one more fragment bearing the 
THISO gene trailing SMP3 gene was obtained by the 
plasmid eviction method. The nucleotide sequence of 
the overlapped region between the two isolated DNAs 
contained an open reading frame of 957 base pairs, 
encoding a 319*amino acid polsneptide with a calcu- 
lated molecular weight of 36,616. When the intact 
THISO open reading frame was expressed as a fusion 
protein carrying three vector-encoded amino acids at 
its N terminus in Escherichia coli lacking TPK, 
marked TPK activity was detected in the procaryotic 
cells, proving that the THISO gene of S. cereviaiae 
encodes a structural gene of TPK. A gene disruption 
experiment demonstrated that the THISO gene was 
essential for growth, and therefore, revealed that TPK 
is the only enzyme capable of ssmthesizing thiamin 
pyrophosphate in yeast. Studies of Northern blot 
analysis and the enzyme assay demonstrated that the 
THISO gene expression is regulated mainly at the 
mRNA level by the intracellular thiamin pyrophos- 
phate and requires the positive regulatory factors en- 
coded by THI2 and THI3 genes. However, unlike thia- 
min-repressible acid phosphatase and the enzymes in- 
volved in thiamin synthesis of 5. cereviaiae^ TPK was 
found to be expressed constitutively at a low level and 
incompletely repressed by exogenous thiamin. 
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Thiamin pyrophosphate (TPP),^ a coenzyme form of vita- 
min Bi, functions in several enzymatic reactions in which 
TPP serves as a transient intermediate carrier of the aldehyde 
group (1). Thiamin pyrophosphokinase (TPK; EC 2.7.6.2) 
catalyzes the direct phosphorylation of thiamin with ATP to 
form TPP (1). The enzyme has been studied by many inves- 
tigators and purified from eucaryotic sources, human red 
blood cells (2), pig heart (3), pig brain (4), parsley leaf (5), 
and procaryote, Paracoccus denitnficans (6). In yeast, TPK 
was partially purified, and the enzymatic properties and ki- 
netics were well defined (7). However, neither the amino acid 
nor the nucleotide sequence of TPK has been known in any 
species. 

Recently, we isolated the thiSO mutant (8) which has a 
partial deficiency of TPK activity during the study on the 
regulation of thiamin metabolism in Saccharomyces cerevisiae. 
The thiSO mutant has the constitutive phenotype of thiamin 
transport, thiamin-repressible acid phosphatase (T-rAPase) 
encoded by PH03 gene (9), and enzymes involved in thiamin 
synthesis from 2-methyl-4-amino-5-hydroxymethylpyrimi- 
dine and 4-methyl-5-i9-hydroxyethylthiazole. The mutant also 
showed high resistance to oxythiamin, a thiamin antagonist 
whose potency depends on TPK activity (8). In the previous 
report, we clarified that TPP is a negative effector of the 
regulatory mechanism of thiamin metabolism in yeast. How- 
ever, it has not been clear whether the THISO gene encodes a 
structural gene of TPK or a positive regulatory factor specific 
for expression of a gene encoding for TPK. 

We report the isolation, mapping, nucleotide sequence of 
the THISO gene, and the evidence supporting that the THISO 
encodes for the structural gene of TPK by the expression 
experiment in Eschericlua coU, We also demonstrated by the 
gene disruption, analysis of THISO that TPK is the only 
enzyme capable of synthesizing TPP in yeast. In addition, the 
regulation of the THISO expression was investigated. 

EXPERIMENTAL PROCEDURES 

Organisms and Cultures — Table I shows the S. cerevisiae strains 
used in this study. Yeast cells were cultured at 30 °C in YPD medium 
(1% yeast extract, 2% bacto-peptone, 2% glucose) or in a defined 
medium containing a 0.67% yeast-nitrogeh base (Difco) supplemented 
with essential amino acids or Wickerham's synthetic minimum me- 
dium (12) supplemented with essential amino acids with or without 
thiamin. Yeast strains auxotrophic for thiamin, thi2 (13) and thi3 
(11) mutants, were cultured in the minimal medium with thiamm at 
a concentration of IQr^ M, which does not cause thiamin repression. 
The E. eoU strain MV1184 was used for the amplification of piasmids, 
and the bacterial cells were grown with shaking at 37 *C in LB broth 



^ The abbreviations used are: TPP, thiamin pyrophosphate; TPK, 
thiamin pyrophosphokinase; T-rAPase, thiamin-represBible acid 
phosphatase; ORF, open reading frame; IPTG, isopiopyM-thio-jJ-D- 
galactopyranoside; kb, kllobase pair(s); bp, base pair(s). 
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Table I 

Yeast strains used 



Strain 



Genotype 



Souice 



X2180-1A MA Ta SUC2 mal gal2 CUPl YGSC 

YPH499 MA Ta ura3'52 his3-C^0O Ieu2-M trpl'A63 ade2-l0l lys2'801 - Y. Ohya* 

YPH500 MATa ura3-52 his3'^00 Ieu2-Al ade2'l0l lys2-802 Y. Ohya* 

IFO104S2 MATa his4'519 gal2 Our stock 

T44-5A MA Ta thi80-l ura3-52 Ieu2-M Our stock 

T48-2D MATa thiSO-l um3-52 his3'£^200 Ieu2-M trpl-^ Our stock 

T36-2A MATa thi80-2 pho2-8 pha4-l trpl gal2 Our stock 

058-M5 MATathi2(pho6)gal4 Our stock (10) 

TRS3 MATathi3leu2-3J12gal2 Our stock (11) 

311/pSMC20-l* MATa SMP3::URA3 ura3-52 his3-A200 leu2'M trpl-^ ade2'101 lys2'80l This study 

YPH501 MATa/MATa ura3-52/ura3-52 leu2'AJ/leu2'Al his3'A200/his3'^00 trpl- Y. Ohya* 

A63/trp2-Ae3 ade2-101/ade2-]0l lys2-802 /lys2-801 
NKC2^ MATa/MATa THm/thiS6::UnA3 ura3-52/ura3'52 leu2'Al/leu2'M his3' This study 
A200/his3'A200 trpl 'A63/trpl-A63 ade2'101/ade2- 101 Iys2'801/lys2-801 

' y GSC is the Yeast Genetic Stock Center, University of California, Berkeley, CA. 
* University of Tokyo. 

' Constructed by transformation of YPH499 with a Kpnl fragment of pSMC20 (representative of 311/pSMC20-l, -2, and -3). 
Constructed by transformation of YPH501 with a Spel/Xhol fragment of pAN8.thiaO::URA3. 



with ampicilUn at a concentration of 50 fis/al. JM105 was used as a 
host strain for expression of the THI80 gene. 

Plasmids and DNA Manipulation — A gene library of 5. cerevisiae 
constructed by partial digestion of the genomic DNA with Sau3AI 
and its ligation with YEpl3 (14) at the unique BamRl site was 
obtained from the American Type Culture Collection (Rockville, 
MD). The shuttle vector pRS316 (15) was utilized for expression in 
yeast and subcloning of the TH180 gene, and YIp5 (16) was used as 
an integrative vector. A plasmid pSMC20 (a gift of Kenji Irie, Nagoya 
University) bearing the SMP3 gene (17) was used for the plasmid 
eviction method to isolate the THI80 gene. Fig. 1 shows the structure 
and construction of the principal plasmids. Yeast chromosomal DNA 
for Southern blot analysis and plasmid DNA were prepami as de- 
scribed by Hoffman and Winston (18). Yeast transformation was 
carried out by the method of Ito et al. (19). Bacterial transformation 
and preparation of plasmids and other routine DNA manipulations 
were performed as described by Sambrook et al (20). 

Genetic Analysis—Mating, sporulation, dissection, and scoring of 
genetic markers in yeast and calculation of map distances with tetrad 
data were carried out by the standard procedures for yeast genetics 
(21). The T-rAPase activity of thiSO mutants on plates was detected 
using the staining method based on the diazo-coupling reaction (22). 

DNA Segijencing— The DNA sequence was determined on an 
Automated Laser Fluorescent DNA Sequencer (Pharmacia LKB Bio- 
technology Inc.) by dideoxy chain termination (23) using an 
AutoRead Sequencing Kit In addition to the fluorescently labeled 
M13 Universal and Reverse primers, further oligonucleotides, for 
walking primer sequencing were synthesized and fluorescently labeled 
with the Gene Assembler Plus (Pharmacia) according to the manu- 
facturer's manual. The subcloned plasmid pAN7 (Fig. 2) was purified 
by equilibrium centrifugation in CsCl/ethidium bromide gradients 
(20) and used as a denatured double-stranded DNA template. The 
sequencing was performed, twice for each primer, on both orienta- 
tions. Nucleotide and deduced amino acid sequences were analyzed 
with programs from a GENETYX software package (Software De- 
velopment, Tokyo). 

Expression and Preparation of Recombinant Protein— An expres- 
sion vector pTrc99A (24) carrying the trc promoter, the lacZ ribo- 
some-binding site, the rmB transcription terminators, and the mul- 
tiple cloning site of pUClS was used. Two oligonucleotide primers 
were synthesized to amplify the intact THISO open reading frame 
(ORF) by the polymerase chain reaction (Perkin-Elmer Otus Instru- 
ments). The 5'-end primer FT3 is • CCGAATTCATGAGCGAG- 
GAGTGTATTGA, corresponding to nucleotides 1,428-1,445 with the 
^coRl linker immediately upstream of the THISO ATG, and the 3'- 
end primer RTl is GCTTCAGAGGTCGACATCCC, corresponding 
to nucleotides 2,443-2.462 holding 5afl sites (the nucleotide number 
refers to Fig. 3fi). A plasmid pAN7 (Fig. 2) was used as a template 
DNA, and conditions of polymerase chain reaction were: 94 *C for 1 
min, 50 'C for 1 min, and 72 "C for 2 min. After 30 cycles, the product 
DNA was purified using Magic"* PGR Preps (Promega). The 1,045- 
bp product designated HJ-F3R1 containing the THISO ORF was 



digested with EcoBl and Sail and subcloned into the EcoBl and Sail 
sites of pTrc99A. E. coli JM105 harboring pTrc99A-THI80 was grown 
at 37 'C to an absorbance of Aeoo = 0.8 in . 100 ml culture, and 
isopropyl-l-thio-/3-D-gaIactopyranoside (IPTG) was added to a final 
concentration of 0.1 mM. Bacterial cells were fiirther shaken at 28 *C 
for 10 h. The harvested cells were washed once with 30 mM Tris-HCl 
(pH 7.5) and 30 mM NaCl and resuspended in 20 ml of the same 
buffer, followed by disruption (100 watts, 5 min) with an ultrasonic 
oscillator (Kubota, Tokyo). Cell debris was removed by centrifugation 
at 20,000 X ^ for 30 min, and the supernatant was served as a crude 
extract 

Gene Disntption^The 1.1 -kb HindJU fragment containing yeast 
URA3 gene of YEp24 (25) was inserted into pAN8 (Fig. 2) at the 
unique i/mdIII site within the THISO coding frame to yield pAN8- 
thi80::URA3. The linear 4.2-kb Spel-Xhol fragment of this plasmid 
was used for the transformation of diploid yeast YPH501. Ura* 
colonies were selected, and gene disruption was confirmed by South- 
em blot analysis. 

Bbt Analysis — Southern blot analysis was performed as described 
by Southern (26). Yeast genomic DNA was digested with £coRI, 
subjected to electrophoresis in a 1% agarose gel, and blotted onto a 
Hybond-N+ membrane (Amersham). For Northern blot analysis (27), 
total RNA was isolated from yeast cells by the method of Schneider 
et al (28). Samples were subjected to electrophoresis in a 1.2% agarose 
gel containing formaldehyde and blotted onto a Hybond-N+ mem- 
brane. Both hybridizations were carried out according to the manu- 
facturer's manual. ^P-Labeled DNA probes were prepared by the 
random primer labeling procedure (29) using fa-'*P]dCTP (3,000 Ci/ 
mmol) (ICN Biomedicals). 

Enzyme Assays— TPK activity was measured by the procedure 
described previously (8). Briefly, the 1.5 ml of assay mixture contain- 
ing 0.02 M Tris-HCl (pH 8.6), 1 mM thiamin, 1 mM ATP, 2 mM 
MnS04, and the enzyme source was incubated for 15 min at 37 "C. 
The reaction was stopped with 0.3 ml of 30% trichloroacetic acid, and 
after centrifugation the acid was removed twice by extraction with 
double volumes of ethyl ether. The amount of thiamin and thiamin 
phosphates in the sample solution were determined by high-perform- 
ance liquid chromatography after conversion to the corresponding 
thiochromes (30). Protein was determined using a protein assay kit 
(Bio-Rad) with bovine serum albumin as the standard. 

RESULTS 

Clonini and Mcpping of THI80-~The THISO gene was 
cloned on the basis of its ability to suppress the thiSO mutant 
phenotype producing T-rAPase in the synthetic medium con- 
taining 5 X 10"^ M thiamin (8), in which medium yeast 
phosphate-repressible acid phosphatases are completely re< 
pressed (31). A thiSO mutant, T48-2D (thiSO-l leu2), was 
transformed with the yeast genomic DNA library and plated 
on the agar medium containing thiamin without leucine. The 
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plates were incubated at 30 for 3 days, and a total of 20,000 
Leu**^ colonies were examined for the T-rAPase activities by 
staining. Sixteen colonies did not show T-rAPase activity, 
and by the plasmid-curing experiment (32) two isolates were 
confirmed to have a plasmid complementing the thiSO muta- 
tion. These plasmids were prepared from the white (T- 
rAPase") Leu"*" transformants and used to transform E. coll 
MV1184 to Amp'. Both plasmids obtained from each Amp' 
transformant conferred the T-rAPase' phenotype on yeast 
strain T48-2D (thiSO-1 leu2) in roinimal medium containing 
thiamin. In addition, T48-2D bearing these plasmids lost the 
resistance to oxythiamin at the concentration of 10'® M. The 
obtained two plasmids could be identical based on their re- 
striction maps, and this plasmid, designated pHJ (Fig. 1), had 
an insertion of a 6.1-kb fragment in YEpl3. Transformant 
carrying pHJ exhibited restoration of the TPK activity and 
the TPP formation in vivo (Table II). To determine whether 
the pHJ contained the THI80 gene, we performed site-di- 
rected integration of the DNA fragment into the yeast chro- 
mosome. A 7.3-kb Nhel'Eagl fragment of pHJ, which carried 
Y£pl3-derived DNA at both sides of the inserted 6.1-kb 
fragment, was subcloned into the NhehEagl gap of YIp5 to 
generate pNESO (Fig. 1). The recombinant plasmid was line- 
arized at the unique Xhol site in the cloned fragment and 
transformed T48-2D (thiSO-l ura3). All Ura^ transformants 
showed the wild-type phenotype for T-rAPase production. 
The stable transformant was crossed with the wild-type 
strain, IFO 10482, and the resultant diploid was subjected to 
tetrad analysis. Of the 12 tetrads tested, all segregated 0-h:4- 
for T-rAPase production on the medium containing thiamin. 
These findings indicate that pNEdO was integrated at or close 
to the thiSO locus and that the cloned DNA carried the THI80 
gene. 

Oil the other hand, when we constructed a thiSO host strain 
used in the transformation experiment, we found a significant 
linkage of thiSO to ade2 and his3 on the right arm of chro- 
mosome XV. Tetrad distribution (Le. parental di- 
typernonparental ditype:tetratype) of ade2 and thiSO was 
47:0:18 and that of his3 and thiSO was 42:4:79, indicating that 
the thiSO locus is located between the ade2 and his3 loci and 
near the ade2 locus. Recently, smpS mutation (17), which 
confers stable maintenance of pSRl (a plasmid of Zygosac- 
charomyces rouxii) in S. cerevisiae host, has been mapped 
between the ade2 and his3 loci and close to the RPB2 locus 
(34). Then, we examined the linkage of smp3 and thiSO. To 
monitor the segregation of SMP3 gene by scoring Ura^ phe- 

Table II 

TPK activity and intracellular concentration of thiamin and thiamin 
phosphates of thiSO mutant (T48-2D) and its transformants 
TPK activity was determined using the crude extract as a enzyme 
source obtained by sonic oscillation procedure as described (33) from 
the yeast cells grown in thiamin-omitted medium. Intracellular con- 
tents of thiamin and thiamin phosphates of the ceils grown in the 
medium containing 5 X 10^^ M thiamin were measured as previously 
reported (8). 



Strain 


Plasmid 


TPK activity 


Intracellular 
concentration 

lliiamin TMP" 


TPP 






nmol/min/mg protein 








X2180-1A 


None 


0.27 


344 


16 


208 


T48-2D 


None 


0.07 


1497 


12 


61 




YEpia" 


0.08 


1385 


11 


60 




pHJ 


1.51. 


353 


27 


348 



'TMP is the abbreviation for thiamin monophosphate. 
^ 2 -Aim-based multicopy vector used for the construction of the gene 
library. 




Fig. 1. Stmctures and constructions of principal plasmids. 

pSMC20 consists of an £coRI-Hi7idIII fragment containing the SMP3 
gene and the large EcoRl-Hindlll fragment of YIp5. Procedures for 
constructions of pHJ, pYK2, and pNE80 are described in the text. 
Thick lines indicate the chromosomal DNA of S. cerevisiae. Abbre- 
viations of restriction sites: C, Cial; E, EcoKl; Eg, Eagl; H, Hindlll; 
K, KpnJ; Nh, Nhel; Pstl; Sa, Sail; Sp, Spel; X, Xhol; Xb, Xbal. B/ 
S is the junction site of BomHI and 5iiu3AI. 

notype, we integrated a plasmid pSMC20 into the SMP3 locus 
of the ura3 mutant, YPH499 {MATa ura3 THISO), by trans- 
formation with linearized pSMC20 DNA at the Kpnl site 
located in the SMP3 coding region. Three independent stable 
Ura* transformants (311/pSMC20-l, -2, and -3) were chosen 
and crossed to T44-5A {MATa uraS thi80-l). The results of 
tetrad analysis with the resulting diploids indicated that the 
thiSO locus was located 3.3 centimorgans proximal to the amp3 
locus (Table III). Since the relationship of genetic and phys- 
ical distances of chromosome XV has been reported to be 3.1 
kb/centimorgan (36), the THISO locus is about 10 kb from 
the SMP3 locus. The value of the estimated physical distance 
moved us to try to clone the THISO gene by the plasmid 
eviction method (37), and one plasmid (pYK2) was rescued 
from the Sofl-digested chromosomal DNA prepared from the 
transformant, 311/pSMC20-l. Then, T48-2D {thiSO-l ura3) 
was transformed by pYK2, and seven out of eight resulting 
Ura* transformants showed wild-type phenotype for T- 
rAPase production on thiamin containing medium. Therefore, 
we concluded that pYK2 carried the THISO gene. As shown 
in Fig. 1, the pYK2 contains the SMP3 gene and an additional 
7.9-kb fragment' on the opposite side of the RPB2 locus. A 
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Table III 
Meiotic mapping of thiBO nmtation 



Gene combination and tetrad distribution" 



Hybrid diploid 




thi80'SMP3::URA3 




thi80-ade2 






ade2-SMP3::URA3 


PD 


NPD 


T Distance* 


PD 


NPD 


T 


DisUnce* 


PD 


NPD 


T 


Distance'* 








centi' 








centi- 
















morgan 








morgan 








morgan 


311/pSMC20-lxT44-5A 


16 


0 


1 


13 


0 


4 




12 


0 


5 




31l/pSMC20-2xT44-6A 


13 


0 


2 


u - 


0 


4 




10 


0 


5 ■ 




311/pSMC20-3xT44-6A 


14 


0 


0 


12 


0 


3 




10 


0 


4 




Total 


43 


0 


3 3.3 


36 


0 


11 


12 


32 


0 


14 


15 



* PD, parental ditype; NPD, nonparental ditype; T, tetratype. 

* Calculated by the Perkins equation: centimorgan = 100(T + 6NPD)/2(PD + NPD 4- T) (35). 



Plasmid 
pHJ 



B/S 



pYIC2- 

pANI 
pAN2 
pAN3 
pAN4 
PANS 
pAN6 
pAN7 
pAN8 
pAN9 



Compte- 
mentation 



+ 
+ 
+ 
+ 

+ 



Fig. 2. Restriction maps of 5. cereviaiae DNAs cloned on pHJ and pYK2 and subcloning of the fragments with the ability 
to complement the T-rAPase constitutive phenotype of a thiSO mutant. Thick lines at the top two plasmids indicate the cloned yeast 
DNA, and the dotted line of pYK2 shows the DNA fragment o(SMP3 gene. Thin lines of pAN plasmids are the subcloned fragments on the 
plasmid, pRS316, and each insert has a multicloning site at each end. The triangle on the fragment of pAN9 indicates the site modified on 
cleavage with /findlll, followed by treatment with Klenow fragment and religation. The ability of each plasmid to complement the thiSO 
mutant phenotype is indicated in the right-hand column: complementation; no complementation. Abbreviations of restriction sites are 
the same as in the legend to Fig. 1. 



restriction map of the additional fragment was constructed 
and compared with that of the plasmid isolated from the gene 
-library, pHJ. The end regions of both maps were similar, 
suggesting that the THISO gene is located within the over- 
lapped region. - 

Nucleotide Sequences of THISO^To define the limits of the 
THISO gene, various subclones were constructed using the 
pRS316 vector from the cloned plasmids pHJ and/or pYK2, 
and yeast tiansformants carrying the subcloned plasmids were 
tested for their ability to complement the T-rAPase consti- 
tutive phenotype of the thiSO mutant (Fig. 2). pAN5 contain- 
ing only the overlapped region of pHJ and pYK2 suppressed 
the phenotjrpe of the thiSO mutation, and the disruption of 
the ffindlll site within this region abolished the complemen- 
tation (pAN9). Therefore, we inferred that the overlapped 
1.5-kb SaxjZPd-SaJl region contains the THISO gene. 

Then the nucleotide sequence of the 2.6-kb SpcI-EboRI 
fragment, including the overlapped 1.5-kb region, was deter- 
mined using pAN7 as the DNA template by the primer 
walking method (Fig. ^A). As shown in Fig. 3B, an open 
reading frame starting at position 1,426 and ending at 2,382 
was found in the 1.5-kb SauStM-SaJl region. A putative TATA 
box, TATAAAA, was found at positions 1,323-1,329 in the 
5 '-upstream region, whereas the polyadenylation signal AA- 
TAAA was not foimd in the 3'-noncoding region. We assumed 
that TAA-{19 bases) -TATG- (60 bases)-TTT starting at 2,418 
and ending at 2,496 is a transcription termination signal, 
since it resembles TAG. . .TA(T)GT. . .TTT proposed by 
Zaret and Shermen (38). The yeast intron splice signal, TAC- 
TAAC (39), was not found. The predicted protein encoded by 
the THISO ORF contains 319 amino acid residues with a 
calculated molecular weight of 36,616. We could not find a 



homolog of THISO in the NBRF database, indicating that the 
THISO protein is a new protein. 

Expression of THISO ORF in E, coli— To ascertain whether 
the THISO gene encodes the structural gene of TPK, we 
attempted to express the THISO protein in E. coU, in which 
TPK dose not exist, and TPP is synthesized from thiamin 
monophosphate by thiamin-monophosphate kinase (EC 
2.7.4.16) (40). The polymerase chain reaction product (HJ- 
F3R1) containing the complete THISO ORF with EcoRl site 
immediately upstream of the THISO ATG and Soil site located 
in the 3'-noncoding region was prepared and ligated between 
these restriction sites into expression vector, pTrc99A. The 
resultant translational initiation sequence is as follows. 



Met Glu Phe Met Ser Glu 
5' . . . AGGAAACAGACC ATG GAATTC ATG AGO GAG. . 

EcdRl 

(Sequence 1) 



.3' 



The expected fusion protein has an Af, of 37,023 and carries 
three vector-encoded amino acids (Met, Glu, Phe) at its N 
terminus followed by 319 THISO-specific amino acids. Bac- 
terial cells harboring the plasmid were cultured at 28 'C with 
or without 0.1 mM IPTG induction, in which condition the 
fusion protein could be obtained in the soluble fraction of ceil 
extract without accumulation of protein aggregates. Fig. 4A 
shows the patterns of SDS-polyaciylaioide gel electrophoresis 
of the crude extracts of the two independent pTrc99A-THI80 
transformants. A 36-kDa band, which was almost consistent 
with the expected size of the fusion protein, appeared in both 
IPTG-induced cells {lanes 3 and 4). 

Then the TPK activities in the cell-free extract were deter- 
mined after the samples were dialyzed sufficiently. Fig. 4B 



17444 



Yeast Thiamin Pyrophosphokinase Gene 




B 

AaACnOCT AGTGAACRA TTGOCTTGOC AMiTGMCAA AACTCTTT6T TTCATACTM 60 

GMAATTM^ AaTCCACAG TCTAT606IT TaAACAAAA TATHAMCT 0CCTT6MAT 120 

CfiUGmCT 6ACGACT0QC nOGCSATGT ICTMAACAA MTAAT0006 ATATTAOGM 180 

GTBCTTGATG CACTTGCAGr TI IGmOm TGATACTGA6 GCTGATTT6A TTTCATCCAA 240 

GMCNSACTA CCG6T0CTAA CATOUCTCT TGGTICMCC CTTAAAGATA TATCCCAAn 300 

AACTGATCTA aCTCCATAA ATGATGTTAT TG6TCAAGCA ACGCTCAATC GTTCCATGGT 360 

TACMMGAA ATAGATTaA GCACAATGAC OOCAGAAAAA OTTAATACTT TCCAQGATCA 420 

AAATCTGGAT GATGAAATGA AMTAAAAH 7GATTACCT6 ATAGATTATA AATTGCAm 480 

AAATGMCCC AATAGACAAC GGHAnACC GmCAATTG AATAmAOC AACACATCCA 540 

AGMCAATTC GAAGCTCGGT AnCCTAHn TAGAGAAOCA AATCATAGGC TTGATAACGA 600 

ATACCnCn AACAGOna AAAAAATGAC AGAATCTACA nAAATTTTT TGG0C1CAA6 660 

AATAaCAAG TA1GA0CATC TICWTCGGC AACAAGGACG AGAAAHOGA AGAAAATTTC 720 

TGATATITTG AAOCAATTCI AA06AATTTA TAATGATGAA AaCTGAAOG AAAATQCOGA 780 

GATTCATCTG C7GAGOG0CA CAACTCAACA AATTAAAGCG GAGATCAATC anui llll 840 

TGMATCOCT AAAAGTGACG CCAATGTCAA GAATGAAAAC AAACAAATAT irGAGTTGCA 900 

TA6TGAAAAT 6TATCAGAAA GACGATACAA AGAmOGTC TATCACTTGT CACAAGAGGC 960 

TGTCTTGAAA AACGTATGGT TCAACCaGA TOCGAOCATA 6TCGTGAGAA AATOGGAGCA 1020 

TTraCACTCA GGCrrnCAA AGAACAAATA ACACTGATAT GAAATTGG6T OCATOGHAC 1080 

AGGAACA06T AlCTATAATA AATATAOCAT 0GTA6TGCCG GGAAATATAC TAC1ACTTTA 1140 

TATATCAATA AAnCTGCAA OCTIGCAnA AGCAAAT6CA OCACAATOCT AAATTOCTTC 12Q0 

TAATA6TTGC CnTATTTOC TCATA6TGCA TATTTCAAAA CAQCGAAAAA TGCAAAATAT 1260 

OCGAAAACTT GAAAAAAAAT GQDGATGGG AATTACAAAT AAAGCCTGAA TGATAAOGCG 1320 

TTXfiQittU ATTTAOCnc TTAAGTAOti TGCTTTTGAT AAAAGCTM> AAnTGTAGA 1330 
TCACAATTGG TCACAAAGAA CAATAAAAAG CTGAATATCA CTXTATC^ ^^^"^^^ 

f TTgA^IJ CTgAAJGTJT Ti^TTgGT AC^ 1500 

«Tr?TWWS«S*W3^*T^rW' ,560 

m^swii'w^wTTf^w^ 1620 

^M^UC^^IJ Cj^/^ f^TT^ AMCTAC/^ '^^^Pf^ 

JAmCj|C<^ AGjp(^jJA I860 

m^^J'TWWI^^JW^JW^ 1980 

JTCTAT^T TAJCCgAGjA Tp^AT pa^^ 2100 

rWl'*i'Tii'T?^rTr^^'T^'fTif^ 2160 

ACrpjTfTAgGAAATjCTgGC^Gjr^ 2220 

pigGTjrA|JAGTGGgATJTT^^ 2280 

^pAACA CAnyrgC Q^TWTJCT IGT^ 2340 

rrWAl^Ag AAATAnCJT TJATAAATTA f T/^IJTTT TGTAAAGTAA OGCATGTGn 2400 

CTTTAACTAT TCGAOCGIMl TCTTGCTTTC GTCATCATCLilQPGATGTC GAOCTCTGAA 2460 

OCTAAACTAT CA6TAATCGG TGAGAATCAG GAGHIATGA TGGAAACGGC ATGAATTC 2518 

Fig. 3. Sequencing of 7H/dO. A, sequencing strategy. Qper»6ox 
indicates the position and direction of the THI60 ORF. Arrows show 
Hhe direction and approximate extent of sequence detennination by 

primer walking. Abbreviations used are the same as in the legend to 
Fig. 1. THI80 sequence. The predicted amino acid sequence is 
shown in a one-letter code below the nucleotide sequence. The putative 
TATA box is underlined, and the wavy underlines indicate possible 
signals for transcription termination. 



shows the chromatogram of thiochromes from thiamin phos- 
phates in the assay mixture for TPK activity. Marked TPK 
activity was detected in the IPTG-induced cells (Le, 43.2 nmol 
of TPP produced per min/mg of protein), whereas no enzyme 
activity was detected in the uninduced cells (panels 2 and 5). 
When thiamin monophosphate, toward which the TPK of 
yeast is completely inactive (1), was used as substrate in place 
of thiamin, TPP formation was not observed in the IPTG- 
induced cells (data not shown). Moreover, potassium ions, 
which are required for the thiamin-monophosphate kinase 
activity of £. coli (40), were not contained in the mixture used 
for TPK activity. These results clearly indicated that TPK 
were produced in the IPTG-induced cells carrying pTrc99A- 
THI80 and that the THI80 gene of S. cerevisiae encodes a 
structural gene of TPK. 

Disruption of THISO^V/e disrupted the corresponding gene 
locus to confirm the identity of the cloned gene further. The 
URA3 fragment was inserted into pAN8 at the unique HindlU 
site within the THI80 coding region to create pAN8- 
thi80::URA3. The disrupted thiSO fragment was excised from 
the plasmid and introduced into ura3/ura3 diploid YPH501 
(Fig. 5A). DNA was extracted from a Ura* transformant 
(NKC2) and simultaneously digested with EcoRl, the result- 
ing fragments were separated by agarose gel electrophoresis, 
and Southern blot analysis was carried out using HJ-F3R1 as 
a probe. Fig. SB indicates that the THISO gene has been 
correctly disrupted. NKC2 was a heterozygous diploid carry- 
ing the wild-type and disrupted alleles of THISO (lane 2), 
whereas the parent YPH501 contained only the wild-type 
allele {lane 1 ). When NKC2 was dissected, less than two spore 
clones survived as shown in Fig. 5C, and all the spore clones 
were Ura"". This finding demonstrated that the THISO is an 
essential gene for growth, and therefore, proved that TPK is 
the only enzyme capable of synthesizing TPP in S. cerevisiae. 

Regulation of THISO Expression — We have already reported 
that the activities of T-rAPase and the enzymes involved in 
thiamin biosynthesis in S, cerevisiae are coordinately re* 
pressed by exogenous thiamin (13). We also elucidated that 
expression of these genes is controlled by two positive regu- 
latory genes. THI2 and THI3, which might be negatively 
regulated by the intracellular TPP. level via some gene(s) (8, 
11). To determine whether expression of the THISO gene is 
regulated in a similar manner to the enzymes involved in 
thiamin metabolism, the effect of thiamin in the medium on 
the abundance of THISO mRNA and TPK activity was inves- 
tigated using the wild-type strain, thi2, thi3, and thiSO mu- 
tants. Cells were cultured imder either low or high concentra- 
tion of thiamin, total RNA was extracted, and then Northern 
blot analysis was performed using THISO, PH03, and URA3 
DNAs as probes, and simultaneously the TPK activity in the 
soluble fraction from the cells was measured. THISO mRNA 
was detected as a single 1.2-kb band (Fig. 6A, i^per panel), 
which was almost consistent with the coding region. The 
abundance of THISO mRNA in the wild-type strain was 
reduced certainly by thiamin in the medium {lane 2, 2), 
whereas the thi2 and thi3 mutants showed low levels oi THISO 
mRNA (lanes 3-6). These levels of THISO mRNA were com- 
parable with those of the TPK activity (Fig. 6B), indicating 
that THISO expression in the wild-type strain, like the PH03 
gene, is controlled at the mRNA level by thiamin in the 
medium and requires the positive fSactors, THI2 and THIS. In 
thi80-2 mutant which showed the TPK activity of about one 
tenth of that in the wild-type strain (Fig. 6B), the THISO 
mRNA was expressed at the same level as that of the wild- 
type strain (Pig, 6A, lane 1,7), suggesting that the mutation 
site of thiSO-2 is in the coding region of the mRNA to disturb 
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Fig. 4. £xpre8sioii of THI80 ia eoli. The growth condition of the bacterial cells bearing the expression vector pTrcSdAiTHISO and 
preparation of crude extracts are described under "'Experimental Procedures.'* A, SDS-polyacrylamide gel electrophoresis* €iijde efeairts (18^ 
fig of protein) from two independent transformants grown in uninduced ((ones; 2 and and OPTG-induced [lanes 3 and 4^) cultures were 
analyi&ed by 12% SDS-polyacrylamide gel electrophoresis using the method of Laemmli (41) followed by Coomassie Blue staining, The 
position of presumed fusion TH180 protein is indicated by the arrows Lanes 1 and^ show mblecidar ^tiGmdard:ph3teins (Bii(^ 
6 (97,400)^ bovine iserum albumin (66,200), ovalbumin (45,000). carbonic anhydrase (31,000), ^bean trypsin Inhibitor SliSOOli lysoi^iTi^^ 
{14*400), B, chromatogram of ihiochromes from thiamin phosphates on high-perfortnance liquid chromatography; Cmd^ extracfe (pLO;;tg^o^ 
protein) from the uninduced (paneZ S) arid iPTG-induced= ipahel 3): ceils were incubated for 15 mxn at37 "C iri the 
100 jd ot the solution after conversion to the thidchromes as described under ''Experimental Procedures'* were ii]||ected!:6iat6 
Parud 1 indicates the chromatognun j&oin authentic thiamin pho^^^ies (100 pmol ifor each chemicals). TMP is the ablsxeNiwtion £l>ri^tl^^ 
monophosphate. 



Fig. 5. Disruption of THISO, A, 
disruption sitrategy. The procedure is de- 
scribed under **Experinieiital Proce- 
dures," The white bar with an izrrow 
indicates the codiiig region. Abbrevia- 
tions of restriction sites are the same as 
in the legend to Fig. h By confirmation 
of the eiqperted DNA change by South- 
em analysis. Lane 1, parental diploid 
YPH0O1; lane 2i transformed opioid 
HKCl A Hindlil digest of 
was^used as size markers and are shown 
pn the left side of the figure. C, tetrad 
analysis of NkC2. Dissection agar slab 
jCpntaining tetrad sets was placed on 
YPB and photographed after incubation 
at 30 for 3 days. ^ 
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the stability or function of the TPK polypeptide. Further- 
more, the decrease in the THI80 mRNA level by thiamin in 
iki3 and iHiSO-^i mutants was low Uane 4, 6, 8)y whidi 
was thought to be caused by an insufficient increasi^ in the 
intracellular concentration of the coxepressoir, TPP (8), due 
to the low levels of TPK activity in these mutants (Fig. 6JB). 
These findings suggested that the THI80 gene expression is 
regulated negatively by TPP, the product synthesized by 
TPKi However, there was a significant difference in the extent 
of repression by thiamin between PH03 and TH180 genes iii 



the wild-type strain (Fig. 6A» lane I, 2). Thiam^ in thei 
medium repressed almost cdmpietely PH03 mRNA in the 
wild-t^e strain, whereas the decrease m the THI80 w^E^A 
level by thiamin was not as marked as thiit oif 'JfiHi^ 
ih addy[tion» TPK activity was no longer decreased when -the 
wild-type strain was grown in the medium cohtaiiioitig 1()^ M 
thiiamin (data not shown). These findings suggested that the 
expression of the THISO gene is incpmpletely repressed by 
thiamih and the basal level expression is not under the control 
of the thiamih r^latory system. 
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Pic. 6. Regulation ot THI80, Yeast strains X2180-1A (wild 
type), 058-M5 ithi2), TRS3 (thiSh and T36-2A UhiSO-2) were grown 
in minimal medium containing 10^ (L) or 10*^ (H) M thiamin. A, 
Norfhem blot analysis. Total RNA was isolated and subjected to 
: Northern blot analysis (10 ^g/lane) as described under "Experimental 
Procedures/ Upper panel, die ^'P-labeled HJ-F3R1 containinig intact 
TH1&) ORF was used as probe. RNA molecular weight standards 
(0J6f^i.77-kb RNA Ladder, life Technologj^es, fnc.) are indicated on 
the rigfU, Middk panel, the 3.2-kb PM-BamHl fragment containing 
fiH03 gene was pirepared from piasmid pKB (4i2) and used as probe. 
Lower panel, the l.l-kb Mindin fragment of URA3 from piasmid 
YEp24 (26) was used as probe, B, enzyme assay. Cnidie extracta were 
prepared by sonication of yeast ceils (33), and TPK activi^ was 
determined as described under ^'Experimentol Procedures." 

DISCUSSION 

We have obtained two DNA fragments to complement the 
constitutive: phenoijTpe of T^iAPase otiheihiSO mutant: one 
firom a yeast gnomic l&raiy &ni the other by the piasmid 
<5vi?*ipn ni^tti<>d The site-directed'integration analysis of the 
E)NA fragment from the library revealed that the cloned DNA 
carries the THI86 gene and not a suppressor gene of thiSO, 
Our cloned fragment froin the library hybridized in a yeast 
chromosome blot to chromosome XV (data not shown)* This 
finding showis an agreement with the genetic mapping analysis 
which demonstrated that the thiSO locus is ioca&d 3;3 cenU- 
morgans proximal to the sn^ locus on the right arm of 
chromosome XV. On the other hand, the other DNA fragment 
obtained by the piasmid eviction method displayed that the 
about 8-kb downstream regions firom the SMP3 ORF were 
jsimiiar in the restriction map to the fragment from the gene 
library, indicating that the latter fragment ako contained the 
THI80 gene and that the THI80 ORF is located wnthin the 
Overlapped region of the two cloned fragments. 

TSie THISd gene revealed an ORF of 967 nucleotides en- 
coding a protein of 319 amino acids with a predicted molecular 
weight of '36,616. This is in good agreement with the identi- 
fication of a i.2-kb transcript by Northern hybridization 
analysis. The expression experiment of the intact THISO ORF 
in E. cvli indicates that the THI80 gene of 5. cerevisiae is a 
structural gene of TPK. When the synthesized DNA corre- 
sponding to nucleotides 1,498-1,517 with EcoRl linker was 
used as the 5'-end primer in place of the FT3 primer in 
polymerase chain reaction with 3' -end primer RTl, andi the 
resultant product DNA was lighted iii expression vector 
pTrc99A^ which was designed so that the second , methionine 
of intact THI80 protein foUows the vector-encoded amino 
acids» no TPK activity was detected in the extract of the 
triansformed E. coli cells induced by IPTG (data not shown). 
We, therefore, considered that the first methionine of the 
THT80 ORF is the initiator methionine; and the N-terminal 
sequence is essential for enzyme activify* We cannot Imd a 



homolog of the amino acid sequence predicted ^m the THISO 
ORF in the database. However, the sequence starting at the 
11th residue,r^-X-X-X-Gly- X-X-Leu-ne-Asn -lie-, |of the 
THISO protein^e^fimbie^ thiS ATP binding motifSBsen^ed in 
Of and p subunits of ATP synthase and other adenine nucleo- 
tide binding proteins (43). The reason why the truncated 
THI80 protein which lacks the first 24 reisiduies kspressed in 
the bacterial cells has no TPK activity is attributed to the 
loss of the presumed ATP-binding domain, althdugh stiicfies 
by means of site-directed mutagenesis and protein . engineering 
are required 

Purified TPK has been prepared from pig heart (3), pig 
brain (4), parsley leaf (5), and P. denitrifwans (6). Recently^ 
Egi et oL reported the isolation and purification of TPK from 
human red blood cells (2). All investigated TPKs had a 
dimeric structure expected for that from the plant. From the 
comparison of the elution volume of TPK iactivity witb, those 
of standard proteins by gel filtration analysis using Se]phac]^l 
S-200, the nK)lecidar weight of the fusion protein was esti- 
mated to be 7i2,000 (data not shown). These fmdings s^ 
that the native TPK of S, cerevisiae exists in the dimer form 
of identical subunits. Hamada (3) indicated that pig heart 
TPK fonns disulfide cross-linkages between two identical 
subunits. the predicted protein encoded by the TH|8^ ORF 
contains 8 cysteine residues, $oine of which inay be m 
in the -linkage between the TiflSO polypeptides 

The bioajnthesis of TPP in S. ixrevisiae involves thie in^ 
dependent formation of the pyrimidihe ahd thiazole moiety 
of thiamin, their condensation to thiamih monophosphate, 
and the subsequent hydrolysis of thiamin monophbspbat^ 
pyrophosphorylation of thiamin formed (ij. Although fee 
thiamin is certainly thought to be an pbHj^te intermediiate 
between &kmin monophosphate ani TPPj tdku^& (^^^^ 
gested the existence 6f ihe enzyme-catalyzing^ 
phoryiation of thiaznih inonopho^hate to form TPiF^ in yieast 
However, the gene disruption experiment in tbi^ 
ihdicated that TPP can be synthesized by nothing birt^^ TPK 
in Si cereuisiaei Previbus studies fi»m d^^ iiBd>oratb2y ha^^ 
shown that thiamin phosphates are hydbxyiyzed % T-r>^^^^ 
in the jperipiasmic space mto free thiamin which csm be: taken 
up by an active transport system in cerevkiae ^^'Vo 
confirm that yeast cells have no ability to transport 'iSP in 
phosphate form, we planted the spores dissected from the 
dipldid, NKC2, on YPD plates containing 10"^ M TPP instead 
of ordinary YPD inedyium in the cotfee oHhe 
tidn expeiiineni The result fcir eight tetrad9 tested was the 
same as that using the mjedlum without TPP (data hot 
shown), indicating that TPP itself cannot be taken i4> by 
yeast celiis and the THJSO ia ^mconditionally m essential gi^ne 
in S. cerevisiae. 

The fmdings obtained by Northern blot hybridization 
analysis iand enzyme assay suggested that the TBIiBd gene 
expression of S. cerevisiae b iregiilated mainly at the iiiRNA 
level by a process that requires the positive factors^ THI2 and^ 
THI3> and the negative factor, TiPP. However, the TPK seems 
to be expressed constitutively at a low level and incompletely 
repressed by exogenous thiamin, whereas the enzymes in- 
volved in thiamin biosynthesis, T-rAPase encoded by PH03[ 
and the thiamin tfanspprt system are almost completely re^ 
pressed by thiamini (10. These findings were cbinceiyable, 
because tiie enzyinatic formation df TPP by TPK encode^ by 
THISO k the only process Sfor supplying this coenzyme fiom 
thiamin in S. cerevishis: Recently; we localized the region 
responsible for Uie transcriptional activation of the Pti03 
gene in response to tliiamih in the medium (42); Although 
c6mparisoi£ land best alijgximeht oiT the 406''bp S'-^ixpstieam 
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noncoding region of the PH03 and THISQ genes showed 
approximately 52% homology, we have not yet determined 
the consensus sequence concerned in thiamin regulation be- 
tween these two genes. It is also unknown at present whether 
a change in THI80 mRNA abundance by intracellular TPP 
concentration in yeast cells is due to the regulation at the 
transcription of the THI80 gene or the degradation of the 
mRNA. To clarify the detailed mechanism of the regulation 
of THI80 expression, further studies at the molecular level 
are required. 
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